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ABSTRACT 

Our recent determination of a Salpeter slope for the IMF in the field of 30 Doradus 
([Selman and Melnick 2005i) appears to be in conflict with simple probabilistic counting arguments 
advanced in the past to support observational claims of a steeper IMF in the LMC field. In this pa- 
per we re-examine these arguments and show by explicit construction that, contrary to these claims, 
the field IMF is expected to be exactly the same as the stellar IMF of the clusters out of which the 
field was presumably formed. We show that the current data on the mass distribution of clusters 
themselves is in excellent agreement with our model, and is consistent with a single spectrum hy 
number of stars of the type with /3 between -1.8 and -2.2 down to the smallest clusters without any 
preferred mass scale for cluster formation. We also use the random sampling model to estimate the 
sta tistics of the maximal mass star in clusters, and confirm the discrepancy with observations found 
bv lWeidner and Kroupal (|2006[ ). We argue that rather than signaling the violation of the random 
sampling model these observations reflect the gravitationally unstable nature of systems with one 
very large mass star. We stress the importance of the random sampling model as a null hypothesis 
whose violation would signal the presence of interesting physics. 

Subject headings: galaxies: evolution - galaxies: star clusters - galaxies: stellar content - Galaxy: 
stellar content - stars: formation - stars: mass function - star formation - initial 
mass function - IMF - Star clusters 



1. INTRODUCTION 

In a recent paper ("Selman and MclniclJ l2005l ) we mea- 
sured the Initial Mass Function (IMF) of the field 
in the 30 Doradus super-association and found that 
for 7 < ui/Mq < 40 the field IMFcaii_be char- 
acterized as a power law with the ISabeteil (|1955f ) 
slope. This result contradicts claims of a steep IMF 
for the LMC field (iMassev et a l. 1995; Massey 2002; 
iGouliermis et al.1 12005[ ). and lends support to the hy- 
pothesis of a universal IMF. However, the observation 
of an initial mass spectrum of the same slope in clus- 
ters and the field goes against the probabilistic count- 
ing arguments of [Van bevcrcn (1982) as interpreted by 
iKroupa and Weidnd i2003, henceforth KW2003). Fol- 
lowing Vanbeveren, KW2003 posit that if the field pop- 
ulation is entirely formed out of disrupted clusters, then 
the field IMF must be steeper because there are many 
more low mass clusters than massive ones, and low mass 
clusters cannot contain stars more massive than the clus- 
ters themselves. 

Although an extensive review of the literature is be- 
yond the scope of the present work, a brief tour will place 
it in its proper context, van Albada (1968b) built groups 
of stars by randomly sampling an IMF f{m)dm and gives 
the formulas for the general order statistics where the 
distribution funct ion of th e max imum mass star is one 
particular case . iReddishI (fT978h gives the formulation 
used by Vanbeveren, which appears to be one of the first 
references that gives the formula for the mass of the max- 
imal stellar mass as an integral of the IMF (Equation [TBI 



^ Recently lOev and Clarkj l l2005bl ) have shown using the ran- 
dom sampling model that the statistics of the maximal mass star 
in a number of OB association shows evidence of an upper mass 
limit in the range 100-200 Mq . 



below). iLarsonl (|1982( ) studied the correlation between 
the maximum stellar mass and the mass of the parent 
molecular clouds in star- forming regions. He noted that 
the observed correlation, M^ax* = 0-33M°j_^^^, could be 
explained by stochastic sampling of an IMF with Salpeter 
slope. To study dyna mical biasing (Ivan Al bada 1968a) in 
binary star formation iMcDonald and Clarke (1993. ) used 
a two-step process in which they sample stars assuming 
that a certain fraction g{N) of them come from groups 
of size N, and then sampled a stellar mass spectrum to 
build severa l statistics of binary s tars. The method was 
extended by lSterzik and DurisenI (jl998f ) to study the de- 
cay of gravitational few body systems. To build their 
clusters they introduced what they called a "two-step" 
approach which later lead them to t he "two-step initial 
mass function" ([Durisen et al.l 120011 ): first draw a clus- 
ter mass from a cluster mass-function, then draw enough 
stars from an stellar IMF to add to the cluster mass. The 
method presented here is similar, but with the important 
difference that we do not censor by mass, but we rather 
work with a cluster spectrum by number, and draw stars 
from a stellar mass function. This has the important 
consequence that we know by construction that there 
are no preferred mass scales other than those present in 
the stellar mass spectrum or the spect rum of clus t ers by 
number. A similar model was used bv lOev et al! (j2004( l 
to study the distribution of clusters by numbers in the 
SMC to conclude that the data for the high mass group- 
in gs studied is consis tent with an distribution. 

Va nbeverenI ()1982f ) using the then assumed Salpeter 
slope for the field stellar IMF concludes that "massive ag- 
gregates would contain more OB type stars than predicted 
by the Salpeter IMF. " Interestingly, KW2003 turn the 
argument around and use the well established Salpeter 
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form for the cluster stellar IMF for m > IMq to infer 
a slope steeper than Salpeter for the field stellar IMF in 
the same mass range. With the exception of the LMC 
work mentioned above, the evidence for a Salpeter slope 
for the fie ld stellar pop ulation is overwhelming, from the 
original , Salpeted ()1955l ) work on th e Milky Way to more 
recent work such as that of IScalol (|1986[) that steepen 
slightly the slope of the high mass end from 2.35 to 2.7 
(for recent reviews on this topi c the reader is referred to 
iKroupal 12002- Elm egreenl 12006'). Recently, Weidner and 
Kroupa (2006; henceforth WK2006) used an extensive 
set of Monte Carlo simulations to investigate the ques- 
tion of whether clusters could be constructed by sam- 
pling stellar IMFs using different sampling prescriptions. 
Their strong conclusion is that the model in which clus- 
ters are built by random sampling of a (Salpeter) stellar 
IMF is falsified by the statistics of the maximal mass star 
in clusters, stating: "With this contribution we demon- 
strate conclusively that the purely statistical notion is 
false, and that the stellar IMF is sampled to a maximum 
stellar mass that correlates with the cluster mass. " 

The purpose of this paper is to revisit this issue and 
to examine the question of whether clusters and the field 
sample a universal stellar mass distribution. We show 
that what really matters is the (poorly determined) clus- 
ter mass spectrum in the range of single star masses 
{M < ISOM©), and that it is more natural to work 
with the cluster "number of stars spectrum", P(n), the 
probability of a cluster having n stars. We address this 
question in two different ways: by studying it from first 
principles, and by actually doing Monte Carlo experi- 
ments building clusters randomly sampling a universal 
stellar IMF and comparing the results with the observa- 
tions. The random sampling model has no other physics 
in it than that input from the stellar mass spectrum and 
the cluster number spectrum. It should be considered as 
a null hypothesis for interesting physical processes: its 
violation signals the presence of interesting physics. Oc- 
cam's razor should be used with all models that violate 
the null hypothesis until strong observational evidence 
renders the model untenable. 

In Section [2] we present the formal framework for the 
subsequent analysis, and give an analytical parametriza- 
tion of the stellar IMF that agrees reasonably well with 
observations at all masses. In that section we also present 
an analytical relationship between the cluster mass func- 
tion and the stellar IMF. We use this relation to conduct 
Monte Carlo experiments to simulate the mass distribu- 
tion of clusters. In Section[3]we compare our simulations 
with the o bserved distribut i on of embedded clusters pre- 
sented by La da and Ladal ()2003l . henceforth LL2003). 
The claim by LL2003 that there is a preferred mass scale 
for cluster formation is not born out by our analysis, and 
a critical discussion to uncover the sources of this discrep- 
ancy is presented. In Sections 13.21 and l4l we challenge the 
view that all stars form in clusters and argue that our 
results favor a view where stars form, or at least acquire 
their final properties, before cluster formation. Section [5] 
summarizes our results and ends with the usual plea for 
more observations. 



2. BUILDING A FIELD POPULATION FROM CLUSTERS: 
THE FORMALISM. 



We will use the term population in the statistica l sense: 
a set with infinitely many elements ()Brandtlll998l ). Con- 
sider a population of stars with a Salpeter frequency dis- 
tribution of masses f{m). The mass m of the stars there- 
fore is a random variable with a frequency distribution 
f{m). Let us draw samples from such population with a 
fixed number of stars n and frequency distribution P{n)'^. 
Each of the samples will be called a cluster although 
such "clusters" can contain a single star. This construc- 
tion is analogous to those used in previo us work studying 
the pr operties of HII regions in galaxies (|Oev and Clarkd 
l2005al ). the more general study of Poissonian fluctuation s 
in population synthesis models bv lCerviho et al.l (|2002| ). 
and the analysis of the isolated massive stars in the Milky 
Way bv lde Wit etlHI (gOOS). 

The frequency distribution function of cluster masses 
will be given by, 

oo 

£_ci{M) ^ ^ J^„(mi,m2,---,m„)F(r7,),(l) 

Tl=l 

where J-n is the multivariate frequency distribution of 
masses for a sample of size n, and the summation is 
understood also as a multiple integral over all masses 
mi, • • • , nin satisfying the constraint that they add up to 
M (which imply quite a complex domain of integration). 
If the sample is random then the following two conditions 
are satisfied: 

(a) the individual must be independent, that is, 
Tn{mi,m2, • • • , m„) = /i(mi)/2(TO2) • ■ • /n(m„), (2) 

(b) the individual marginal distributions must be iden- 
tical and equal to the frequency distribution of the 
parent population, that is, 

/i(toi) = /2(to2) = • • • = /„(m„) = /(to). (3) 

We can write an explicit expression for the cluster mass 
function (we will use lowercase for stellar quantities and 
uppercase for cluster quantities). Because we consider 
only random samples, the variable M = toi -|- m2 -|- • ■ • -I- 
m„ is also a random variable. Thus, the distribution 
function of M, Fn{M), can be written as 

-|-oo 

F„{M) = J /(toi)/(to2) • • • /(m„)dTOi • • • dm(^] 

J\/<mi+m2H hm„<A'J + £iM 

We have used a somewhat unusual notation under the 
integral sign to indicate that the domain of integration 
is restricted to total masses between M and M + dM 
only. We can write this condition on the total masses as 
a Dirac delta-function in te rms of its Fourier expression 
orse and Feshbaci3[l95a 

SiM-J2^j)^— e-*(*^-^"^)*dt. (5) 

'^'^ J —oo 

In this paper the symbol P{x) stands for probability when the 
random variable x is a number (i.e. n), and for a the frequency 
distribution when x is a mass (i.e. m; M;m\Al). The meaning 
should be clear from the context. 
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This allow us to integrate over all positive mj and thus 
to avoid the problem posed by the difficult domain of 
integration: 

FniM)^— / e-^(^^-^'"^)*dt/(mi)/(m2)---/(m„)dmi 

n 



1 

2^ 



where (l){t) is the characteristic function of f{m), that is, 
the Fourier transform of the probability density: 



Thus, 



Fn{M)-- 



1 

2^ 



~iMt 



(9) 



(10) 



Finally, the cluster mass function (Equation 1) can be 
written as, 

CO 

^e/(M) = ^F„(M)P(n), (11) 

-1 p-\-oo 

UM) = — J_ e-^'"Y.P{n)r{t)dt (12) 



$(*) 

were P{n) is an arbitrary probability distribution of the 
number of stars in clusters. We see above that the char- 
acteristic function of the cluster mass function, <&(<), and 
the characteristic function of the stellar mass function, 
(j){t), are related as. 



■p{nW{t). 



(13) 



Equations [TTIfTSl form the basis for either an analytical, 
or a Monte Carlo approach to the statistical simulation 
of clusters. Its importance resides in that it relates the 
cluster "number of stars" distribution function, P{n), the 
(universal) stellar IMF, and the actual cluster mass func- 
tion. We will study a simple analytical case to illustrate 
its properties and proceed with full Monte Carlo simula- 
tions. Consider for example the case in which the cluster 
stellar mass function is simply /(m) = 6(rn — m^,), that 
is, a cluster with a single stellar mass species of mass . 
In this case (j){t) = e""**, and (?I)"(i) = e™™** from which 
we obtain F„(M) — 5{M — nm^,) as expected. More 
generally, we can use the relationship between cumulants 
and moments of a distribution (Kendall and Stuart 1977, 
p. 69) to determine that the mean mass of F„(M) scales 
with n, and its width scales with ^/n. 

We should notice that we have constructed a set of 
clusters with strictly the same mass spectrum as that of 
a field built by their total destruction. Since we can set 
P(n) to be any function, and in particular P{n) — for 
n < N for an arbitrary N, this important result holds 
independently of the lower cut-off in the cluster number 



spectrum: the stellar mass spectrum of clusters and of a 
field built entirely out of disrupted clusters can be strictly 
the same. At first sight this may seem to be an almost 
trivial result, but notice the subtlety revealed by the fol- 
loyf^j^f^gedanken experiment: sample a universal stellar 
IMF to create a sample of n clusters with different num- 
bers of stars according to P{n) and partition this sample 
acco:(^i|ng to their mass M to determine the stellar IMF 
conditioned to the parent cluster mass, P{m\M) (the 
probability that a star has mass m if the parent cluster 
mas^g^ M). For M in the range of single star masses, 
this probability is not independent of M and one would 
get the impression that the stellar IMF does depend on 
the cluster mass, that is, it is not universal. However, we 
know by construction that the clusters have their stars 
drawn from exactly the same stellar IMF; what depends 
on cluster mass is the conditional probability. The pur- 
pose of this paper is to investigate whether the obser- 
vations are in agreement with the P{m\M) that derives 
from the random sampling model, or whether they falsify 
it. Furthermore, we know from probability theory 



P{m) 



+ 00 



P{m\M)P{M)dM 



(14) 



that in the above construction we should recover the 
input stellar mass function. Thus, for given P{n) and 
P{m), P{m\M) and P{M) must conspire so that Equa- 
tion 14 is satisfied. Using this relation one finds for the 
simple case of the single mass species clusters with n stars 
that P{m\M) = 6{M — nm* — m A- m,) = 5{ni — to*), 
independent of M = tito*, as it should be. 

As it is shown in Section 13.21 this "conspiracy" is 
not present in other treatments of this problem, where 
P{m\M) and P{M) are taken to be totally independent. 
Because of this it makes more sense to work with the 
cluster "number function", P{n). 

Equations [TTIfHl are the fundamental relations relat- 
ing the stellar and cluster mass spectra in the random 
sampling model. Given the stellar mass function /(to) 
and P{n) they fix the form of the cluster mass function 
(,ci{M) and of P{m\M), which can then be compared 
with observations. 

2.1. Monte Carlo simulations 

Using the formalism described in the previous section 
we build clusters by randomly sampling the following 
"universal" stellar IMF, 



j^ag-(m/m2)'' 

dN (X - — — —dm 



2)7/2' 



(15) 



where 7, a, toi , TO2 and q are chosen to give the appropri- 
ate behavior at low and high masses: — l + r = a — 7 = 
-2.35, TOi = 0.3Mq, TO2 = I5OM0, g = 3. Figure 1 
shows this analytical stellar IMF together with the IMF 
of the Trapezium cluster (iHillenbrand and Carpenteil 
I2OOOI : iLuhman et al.ll206oHMuench et al.ll2002[ ). Our an- 
alytical formula departs from the observations at the 
higher masses because we have chosen to preserve the 
Salpeter slope for 1 < to/M© < ~120. Henceforth we 
will call this function the Salpeter IMF. 

We sample the Salpeter IMF to create clusters with n 
stars assuming a scale- free frequency distribution P{n) cx 
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Fig. 1. — The analytical stellar IMF that we use for our MC ex- 
periments compared with the IMF of the Trapeziu m cluster b y 
I Hillenbrand and Carpenter (200 3), steps : [Luhman et al.l (120001 ). 
segmented solid lines; and Muench et al.l 112002) . asterisks. 

n^, and we build the cluster mass spectrum using Equa- 
tion 10, 

oo 

71=1 

where F„(M) is the mass distribution function of clusters 
with exactly n stars. Notice that this process is scale-free 
only if the sum starts from n = 1, in which case the only 
mass scales of the problem are mi and niu , the lower and 
upper mass cut-offs of the stellar IMF. Our Monte Carlo 
simulations consist of repeatedly drawing n stars from 
the Salpeter IMF, calculating M — mi -I- • • • -I- m„ as 
the mass of a cluster with n stars, and then obtaining 

Fn{M). 

For the value of /3 there have been a multitude of 
studies of massive clusters in galaxies, which gives for 
the mass functions values ranging betwee n = —1.85 
(Ide Griis and Andersl[200l to /3 = -2.4 ([Hunter et alj 

mMore extensive referei ices are given in; ElmcgrccrJ 
. For smaller clusters Ide Wit et all (|2004, 2005) 
claim that their data on isolated massive star formation 
can be understood if (3 — —1.7. For massive clusters one 
can directly use the same exponent for the mass function 
as for P{n) because, as discussed above, the total mass 
scales with n and the width for fixed n scales with ^yri. 
In this work we will explore (3 = —1.8, —2.0, and —2.2. 

3. COMPARISON WITH OBSERVATIONS 

We have identified two observational tests that can be 
performed to check the validity of our null hypothesis. 
First, we will see if we can reproduce the form of the 
embedded cluster mass function; second, we will see if 
we can reproduce the statistics of the most massive star 
in clusters. We are aware that we are leaving out tests 
regarding the characteristics of small n multiple systems, 
which could falsify it^. But multiple systems, although 
numerous, are not the main source of stars in the field, so 
they will not affect the main conclusions of the present 

^ The study of the statistics of small n multiple systems is beyond 
the scope of the present work, but even here where observations 
of the frequency of high mass doubles appear to violate the sim- 
ple random sampling model, there are physical mechanism which 
explain them preserving t he model, namely, dynamical biasing(soe 
ISterzik and Durisen|[T99^ . and references therein). 



work, namely that the stellar and cluster field IMF can 
be the same. 

3.1. The embedded cluster mass function 

Due to the difhculty defining unbiased complete sam- 
ples, the important range of clusters masses in the regime 
of stellar masses is not well studied. There are nev- 
ertheless two relatively recent sources based on exten- 
sive surveys of the liter ature at the tim e of pu blication: 
iPorras et all (|2003) and lLada and Ladal (|2003). We pre- 
fer to use LL2003 four our analysis because they give 
estimates of the masses of the clusters, although only 4 
of the clusters in the Porras et al. list that satisfy the 
constraint on minimum number of stars of LL2003 are 
not included in this catalog. The cluster masses given 
in LL2003 were obtained by modeling source counts as 
a function of limiting magnitudes for two model clusters 
with ages of 0.8 Myr and 2 Myr, corresponding to the 
ages of the Trapezium and IC 348 clusters respectively. 
They assumed a universal IMF and used the average of 
the mass determined for the two assumed ages. 

Figure 2 shows the empirical data of LL2003 together 
with the results of six runs of our MC experiments in 
which we built clusters with the above P{n) for n > 35, 
drawing 72 clusters at the time (the parameters of the 
observations of LL2003). We note the excellent agree- 
ment between the simulations and LL2003 except for the 
mass bins at log M ^ 0.95 and log M ^ 3.5 which are to- 
tally de-populated in LL2003. For /3 — —1.8 the smallest 
mass bin is de-populated in 15% of our simulations while 
in 70% of the simulations contains 2 clusters or less. The 
highest mass bin is populated in only ~ 40% of the sim- 
ulations and in almost 100% of the simulations contains 
less than 2 clusters. LL2003 proposed that the down- 
turn at smaller masses was evidence for a favored cluster 
formation mass scale at around M ~ 5OM0. However, 
our simulations indicate that this downturn is naturally 
explained by the cutoff in n they introduced in an oth- 
erwise scale-free spectrum, without the need to invoke 
a special cluster formation scale. The figure shows that 
the data is best modeled if the cutoff in n is a bit larger 
than the LL2003 criterion of n > 35 to select clusters. 
This is probably the effect of having a sample with an 
inhomogeneous magnitude limit so that n > 35 becomes 
only a lower limit to the actual cutoff. 

3.2. The statistics of the maximal mass star in clusters 

WK2006 argued that their observed correlation be- 
tween the maximal star mass and the total mass of clus- 
ters is not consistent with the hypothesis that clusters are 
formed by random sampling of a universal stellar IMF. 
WK2006's sample of clusters is strongly affected by a 
size-of-sample effect (see Appendix IXj) . Because of the 
impracticality of finding a large ensemble of small clus- 
ters and thus avoid the problems introduced by the size- 
of-sample effect, WK2006 performed Monte Carlo ex- 
periments to determine the statistical properties of each 
of their three sampling methods. We were puzzled by 
their Figure 3, which shows that for their random sam- 
pling method (that corresponds to our Monte Carlo mod- 
els), the curves of maximal mass star versus cluster mass 
(Meci) have two maxima in the range between ~ 25M0 
and 250Mq. For example, for Med = lOOMg the 
curve peaks at 10 Mq and then again at ~ 100 Mq. 
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Fig. 2. — The Lada and Lada (2003) spectrum of masses of em- 
bedded clusters together with the results of Monte Carlo simula- 
tions in which we sample the stellar IMF with a number probability 
distribution /3 = —1.8, top row; (3 = —2.0, middle row; (3 = —2.2, 
bottom row. For n > 35, left column; and n > 70, right column. 



Since it is precisely in this mass range that the model 
curve departs most strongly from the data points, we 
thought that the double peaks could be the result of a 
computational error. We therefore decided to repeat the 
calculations using our independent algorithm to build the 
bivariate (maximal stellar mass - cluster mass) probabil- 
ity distribution. The results of our simulations are shown 
Figure O where, much to our surprise, we reproduce the 
double peaks obtained by WK2006! 

Figure 3 shows the results of many MC experiments 
of the random sdampling model from which we have cal- 
culated the bivariate probability distribution to have a 
cluster in a given log-mass bin of width 0.5, with a star 
of maximal mass in a log-mass bin of width 0.1. Lighter 
areas correspond to a higher probability. Figure 3a cor- 
respond to MC experiments in which anything is consid- 
ered a cluster, even system with n=l. Figure 3b consid- 
ers only cluters with n>50. The vertical line in Figure 3a 
marks the position of logM — 1.8. Notice that as one 
moves from bottom to top along this line one will cross 
contour levels that at first increase until the bivariate 
distribution reaches a maximum at logm^^^ « 0.9. If 
one continues to move it will reach a minimum at ap- 
proximately logm^^^ « 1.4, and then it starts increas- 
ing again reaching a maximum at the point in which 
all the cluster mass is in a single star. Notice that this 
double (local) maxima feature comes from the nature of 
the probability distribution of star masses conditioned to 
cluster mass ((Fig SI see below). 

Interestingly, the c luster s from the compilation of 
ner and Kroupal (|2006t l ( crosses in Fig. [3|) all con- 
centrate in the ridge of the distribution defined by 
the first (lower mass) peak described above and do 
not cover the full mass range allowed by our models. 
To a lesser extent this is also true of the models by 
IWeid ner and Kroupc] (|2006D . whose Fi gures 4 and 5 show 
the data to have a much smaller dispersion around the 



mean than the models. Nevertheless, the random sam- 
pling method deviate most from the data due to its "dou- 
ble peaked" mass distribution. With our preferred ran- 
dom sampling method clusters with masses in the stellar 
mass range the most massive stars can have masses sim- 
ilar to the total cluster mass. Moreover, the conditional 
probability distribution of stellar masses, P{m\M), for 
clusters in the stellar mass range shows that some clus- 
ters can be dominated by one or at most a few high mass 
stars (Fig . This increase in the probability distribu- 
tion of stellar masses near the clust er mass is also visible 
in Figure 2 of iDurisen et al.l (|2001h . so the critical ques- 
tions are whether this effect is real, and if so, whether 
it is significant. The fact that the effect is seen in three 
independent investigations argues strongly for the reality 
of the peak in the random sampling model, but its sig- 
nificance is debatable. On the one hand, the effect arises 
from partitioning the data into mass bins, and it is forced 
into existence by the need to satisfy Equation [HI so it 
is of no physical significance. On the other hand, it bi- 
ases (toward large values) the mea n maximal stellar mass 
versus cluster mass curve used bv IWeidner and Kroupal 
(|2006| ) to falsify the random sampling model, so it is 
highly significant. 

Is this a real violation of our null hypothesis signal- 
ing the presence of some interesting physical effect or is 
it the effect of improper data, or its analysis, or both? 
Although WK2006's conclusions are based on a very lim- 
ited data-set*, it is unlikely that this alone can explain 
the difference in the distribution of the data points and 
that predicted by the models: the number of clusters in 
each mass bin is small but the total sample is not that 
small and at all masses the data points are delineating 
the lowest maximal stellar mass ridge of the distribution. 

One possible explanation could come from the highly 
hierarchical nature of young stellar systems and the 
somewhat arbitrary way in which the parent object of the 
maximal star is chosen. For example, the cluster Wester- 
lund 2 has the well known massive binary WR20a, both 
components of which with masses ~ 80Mq (jRauw et al.l 
[2004, 2005). Considering that the ratio of its components 
separation to the cluster size is smaller than the ratio of 
the cluster size to that of the Milky Way, there is no 
objective reason not to have such binary star as a single 
point in Figure [31 in which case we would have a data 
point in the area devoid of points in that Figure. An ob- 
jective algorithm to identify clusters of different number 
of stars is needed . A step in that direction is t he work of 
lOev et all (|2004D which used the algorithm bv lBattineHH 
( 19911) to identifv groupings of stars and showed that the 
distribution of clusters by number of stars is consistent 
with ri~^ down to single stars. What is needed is then to 
determine the maximal mass star and total mass of those 
clusters to build the bivariate probability distribution. 

Finally, another explanation is that these data do in- 
deed falsify our null hypothesis and that they signal the 
presence of some interesting physical effect that results in 
the mass of the most massive star to depend on the mass 
of its parent cluster. One possibility could be that stars 

^ For example, their favore d sorted samp l ing model p redicts no 
single star clusters at all, while'de Wit et al.l ||2OT , [20051) find truly 
isolated ma ssive stars of spe ctral types ranging between 05 and 
09 (see also lZinnecker and Yorkaii2007 ). None of these "single star 
clusters" are included in KW2006. 
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Fig. 3. — (a, top) The bivariate probability distribution function 
of maximum stellar mass and cluster mass. T he overlaid points 
correspond to the data in lWeidner and Kroupal (120060 . This figure 
show the result with a cluster P{n) ~ starting at n=l. The 

grey levels correspond to the bivariate probability of finding a clus- 
ter in a log-mass bin of size 0.5 X 0.1. The white circle represents 
WR20a taken as a system. The points represent the data points in 
WK2006 with a few additions from Weidner (2007). The vertical 
line is drawn at a log M value of 1.8. (b, bottom) Same as (a) but 
with n > 50. For details see main text. 



form in an ordered fashion, less massive stars forming 
first. Once a high mass star is formed this one cleans the 
cluster of its placental materi al and the cluster rapidly 
disintegrate (|Elmegreenlll983l ). This is the explanation 
favoured by WK2006. But there is another possibility: 
clusters with maximal mass star of too large masses can 
be more gravitationally unstable. A cluster with a max- 
imal mass star of very large mass is characterized also 
by having a smaller total number of stars, and its con- 
ditional s t ellar mass function is flatter (see Figure |4|). 
iTerlevichI (|1987[ ) found that a flatter mass spectrum re- 
sults in a considerably smaller half-life: her model XV 
evolves an order of magnitude faster that a cluster with 
a normal Salpeter IMF. This explanation has the feature 
that it does not violate the null hypothesis, because clus- 
ters are formed according to the random sampling model 
but their lifetimes, and thus the probability of observing 
them, depend on the mass of the maximal stellar mass 
in them. 




Fig. 4. — Conditional probability distributions of stellar masses 
conditioned to cluster mass. The topmost curve correspond to 
P(m) where the cluster mass has bin marginalized. The other 
curves correspond from top to bottome to P(m — M) for log M 
equal to -1.25, -0.75, -0.25, 0.25, 0.75, 1.25, 1.75, 2.25, 2.75, 3.25, 
3.75, and 4.25 respectively. 

4. DISCUSSION 

As mentioned in the Introduction, our results depart 
radically from those of KW2003. Their conclusion that 
the field built by clusters must have a steeper stellar IMF 
can be critic ized on several acc ounts as follows. To begin, 
as noted by lElmegreenI (|2006[ ) , for the observed range of 
cluster masses, the predicted steepening in the stellar 
IMF is rather small going from F — —1.35 in clusters 

to F 1.5 in the field (Figure 2 in KW2003). This is 

well within the observed variance in the stellar IMF of 
clusters, and can be comfortably ascribed to sy stematic 
error s in the determination of the slope (see e.g. iKroupal 
[200l . 

A second problem is that KW2003 implicitly assume 
that the cluster mass spectrum is well determined down 
to masses comparable to those of the smaller stars and is 
approximately given by ^c; ~ Af~^ for 5 < M/Mq < 10^. 
As mentioned in the previous section, however, LL2003 
claim that the cluster mass spectrum turns abruptly 
down for masses below M ~ 50Mq, that is, it appears 
to have a preferred mass-scale. This reduces the number 
of small mass clusters thus reducing the predicted differ- 
ence between cluster stellar IMF and the field. Almost 
paradoxically, our model invoking a universal stellar IMF 
shows that this preferred mass-scale is most likely not a 
physically significant feature of cluster formation! 

Another problem is that KW2003 use a Procrustean 
approach in their modeling of clusters where all clusters 
are forced to have one maximal mass star, that is, a star 
of the maximum mass allowed by the stellar IMF. This 
assumption. 



1 = 



f{m)dm, 



(16) 



forces the upper mass cut-off of the IMF to be an increas- 
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ing function of cluster mass, varying as 

jV/i/=^. From the 

discussion leading to Equation [Ml it is clear that we can 
not recover the input IMF unless we include in our sums 
cl usters for which m — M . 

lElmegreenl (j2006) expanded the formalism of Vanbcv- 
eren and showed analytically and numerically that for a 
power-law mass distribution of clusters of slope (3 < 2, 
the summed IMF of a population of clusters is indistin- 
guishable from the cluster IMF. However, the functional 
form of the conditional probability P{m,\M) adopted by 
Elmegreen does not satisfy Equation [14] for any value of 
/3; it just happens that for /? < 2 the summed IMF is 
almost the same as the cluster IMF (they differ by a log- 
arithmic multiplicative factor). This clearly shows that 
the finding of KW2003, that for /3 > 2 the summed IMF 
becomes significantly steeper than the individual cluster 
IMF, arises from the assumption that, besides a normal- 
ization factor that depends on the cluster mass through 
Equation [ini P(m) and P{m\M) have the same func- 
tional form, (simple power-laws of the same slope in the 
case of Elmegreen). While this is a very good assump- 
tion for very massive clusters, it clearly does not apply 
for clusters in the stellar mass range which are the ones 
responsible for "tilting" the sum-IMF for (3 > 2. Thus, 
even within the Vanbeveren formalism the IMF of clus- 
ters and the field can be strictly the same for any cluster 
mass distribution. 

The results of WK2006 are confirmed in this work in 
the sense that we also find that real clusters cover a sig- 
nificantly smaller part of the maximal stellar mass and 
cluster mass space than permitted by the random sam- 
pling model. WK2006 go then to modify their sampling 
algorithm in such a way as to reduce the probability 
of having very large mass stars in their clusters argu- 
ing that "Star clusters appears to form in an ordered 
fashion, starting with the lowest-mass stars until feedback 
is able to outweigh the gravitationally induced formation 
process. " Although this is a possible scenario we favour 
a different one in which the area of the bivariate distribu- 
tion allowed by the random sampling model is rendered 
unstable due to the extreme nature of the mass spectrum 
therei n, in accordance with the simulations of iTerlevichI 
(|1987t l. How much "trimming" of the bivariate distribu- 
tion can be actually accomplished this way will be the 
subject of a future work. 

Our strong conclusion is that the observations of the 
stellar IMF and the mass spectrum of young clusters are 
consistent with the hypothesis that clusters form by ran- 
dom sampling of a universal stellar IMF. This conclusion 
leads us to challenge the received hypothesis that clusters 
are the fundamental building blocks of the s tellar pop- 
ulations in galaxies (|de Wit et al.ll2004 [2005). In this 
view clusters are given an independent existence from be- 
fore the time that stars form. In our view, following the 
ideas of Elmegreen (1997), stars form in giant molecular 
clouds in a hierarchy of structures with different numbers 
and masses. Some of these structures end up forming 
large clusters (which will later dissolve) and some don't: 
they become part of small associations of stars formed in 
neighboring regions almost by chance. In fact, the ob- 
servations of de Wit et al. (2004, 2005), that are used as 
standard references for the view that clusters form first, 
find that 30% of young massive Galactic field stars are 
not members of clusters or OB associations. Of these. 



about 50% are runaway star candidates. They conclude 
that 4±2% of the stars in their sample result from truly 
isolated high-mass star formation, a number that can be 
reproduced "assuming that all stars are formed in clus- 
ters that follow a universal cluster distribution (by N^) 
with slope f3 ^ —1.7 down to clusters with a single mem- 
ber. " If we consider single star clusters the statement 
that all stars are formed in clusters becomes a tautol- 
ogy. 

Our results hint at a strong universality hypothesis for 
the IMF where not only the power-law part, but the 
full function may be universal. Clearly this claim has 
profound implications for understanding how stars form 
and therefore its foundations require considerably more 
observational work than was available for the tests pre- 
sented in this paper. 

5. SUMMARY 

This paper combines four separate results within a sin- 
gle unified view. The unification is actually a result of 
Equation [TT] (that is derived formally in the first section 
of the paper) which relates the cluster mass function with 
the probability distribution of number of stars in clusters, 
and the (universal) stellar IMF. These results are 

• the IMF of a field stellar population and that 
of the clusters out of which the field was built 
can be strictly th e same. This result contra- 



dicts [Krou2Z^nTWeidner i2003D . and was implicit 
i n the models bv [LarsonI ( 19821. Oev and Clarke! 
(l2005al lbh. and lde Wit etal.. (2004); 



• the observations of th e lower end of th e clust er mass 
function, as given bv lLada and Ladal (|2003D . agree 
with the random sampling model presented here 
if: (a) the distribution of clusters by the number 
of stars they contain is a scale- free power law, , 
with l3 between -1.8 and -2.2; (b) the stellar IMF 
is independent of n and it is given by the Salpeter 
form; 

• the observed special mass scale for cluster forma- 
tion claimed by Lada and Lada arises from the ar- 
bitrary cut-off in the number of stars imposed by 
them; 

• the interpretation of the statistics of the most mas- 
sive star in clusters is a valuable tool to study clus- 
ter formation processes as the observations, taken 
at face value, violate the null hypothesis repre- 
sented by the random sampling model. Although 
a proper observational study requires a sample in- 
cluding many clusters with only a few members, we 
believe that the observations presented by WK2006 
are at worst compelling. Nevertheless, it is argued 
in the present work that the discrepancy is due to 
systems which are rendered gravitationally unsta- 
ble by the presence of one or more very massive 
stars. 



We would like to thank our anonymous referee whose 
comments helped us to improve this work. 
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Fig. 5. — The maximum stellcir mass in a sub-sample of the data of WK2006 analyzed as detailed in the main text. The x-axis show the 
average mass of the clusters in the 800 Mq "super clusters" , while the y-axis show the mass of the maximal mass star in the super clusters. 

APPENDIX 

SIZE-OF-SAMPLE EFFECT 

The size-of-sample effect arises in many areas of astronomy, and we have studied it in the context of the distribution 
of sizes of super-associations in galaxies (Selman and Melnick, 2000). The formalism developed there can be translated 

mutatis mutandi to the present context as follows: Let the whole set of star clusters to be analyzed be denoted by 
C = {CjjiLi, where Cj is the i-th cluster with mass Mi and maximum stellar mass w"'"'^*. We will assume C to be 
ordered from the most massive to the least massive cluster, that is, i < j => Mi > Mj 



average mass of the I most massive clusters. From C we will draw Ns sub-samples, Sj 



Let Ml = Y,Ui Mi/l be the 
of equal mass. Mi, defined by 



where rij is defined by the expression. 



= {Ci}Zj, 



Y,Mi = Mi. 



Thus, the sub-sample Sj contains the cluster Cj and the next iij — 1 less massive clusters, enough to add up to a total 
mass equal to M;. We will assign to each sub-sample j two numbers: m™°^*, and M""^, defined as 



1 



3 + 



is equal to the maximum stellar mass of all the members of iS,- 



and Mp 



their average mass. We will refer 



to sub-sample j as "super-cluster" j. Because all "super-clusters" thus defined have approximately equal total mass 
(rs Ml), we can compare the mass of their maximal star without having a size-of-sample effect. 

Regrettably, the data set and the metod of analysis used by WK2006 is far from what is needed for this kind of 
analysis. Their Table 1 lists 17 clusters with masses ranging from 2^Mq to 10^ Mq. We have seen that we should 
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actualy work with the number of stars instead of the mass of the clusters as conditioning to cluster mass introduces 
unphysical mass scales. If we use an average stellar mass of 0.3Mq then the smallest mass cluster corresponds to a 
cluster with « 75 stars while the largest mass clusters corresponds to « 3 x 10^ stars. The sample should include 
at least 4000 clusters with 75 stars to meaningfully compare the maximal mass of this artificial "super-cluster" with 
the maximal mass of a cluster of IO^Mq (such as R136). Figure [5] plots the maximal mass against total mass for the 
best sub-sample of "super-clusters" that can be constructed from the data of WK2006 using the algorithm described 
above. This sample consists of NGC6530 as the first "super-cluster"; NGC 2264, Mon R2, and a Ori in the second; 
Mon R2, a Ori, NGC 2024, and IC 348 in the third; and a Ori, NGC 2024, IC 348, p Oph, NGC 1333, Ser SVS2, 
and Taurus- Auriga in the fourth. Each of these four "super-clusters" has a total mass of approximately 800 M0. 
The "best-set" shows no correlation between maximal star mass and cluster richness: the claimed correlation was a 
size-of-sample effect. (It is possible to construct other sub-samples having a more massive star in the upper mass bin, 
but these sub-samples contain only 2 or 3 super-clusters.) 
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